Apache Hive Reviews & Ratings 2024

Overview

What is Apache Hive?

Apache Hive is database/data warehouse software that supports data querying and analysis of large datasets stored in the Hadoop distributed file system (HDFS) and other compatible systems, and is distributed under an open source license.

Recent Reviews

TrustRadius Insights

December 15, 2023

Apache Hive is a versatile software that has been widely used across various departments and organizations for different use cases. It has …

With Apache Hive, you can enter the world of Big Data

8 out of 10

July 06, 2022

On-premises large data processing is handled by Apache Hive, which is running on Cloud ERA Servers. In order to use Apache Hive, you must …

Best Distributed Database in the market

6 out of 10

April 19, 2022

Incentivized

We use Apache Hive to store a large set of data, which are huge documents such as problem statements and its answer, not only submitted by …

Help your dev team !

8 out of 10

April 12, 2022

Incentivized

We build our data lake and perform queries on large amounts of data. We group data from multiple sources into a common structure, making …

Spectacular SQL-like interface for accessing Hadoop

9 out of 10

April 11, 2022

Incentivized

To manage and view Apache Hadoop data in a SQL-like format To be able to query databases across the organization, quickly To query data …

This system makes active data of value.

8 out of 10

April 09, 2022

Incentivized

We have used the system to migrate data either for new versions or because we will use another operating program, the software helps us to …

Best query platform for ETL.

6 out of 10

April 08, 2022

Incentivized

I used Apache Hive on top of Hadoop for filtering and cleaning data using SQL. It was the part of the project which I was working on. …

It is an advance to the ease of the processes

8 out of 10

April 08, 2022

Incentivized

The software is intuitive from the first steps, one of the first features we take into account for the software does not allow duplicate …

Capabilities of Apache Hive

8 out of 10

April 07, 2022

Incentivized

Main purpose for using Apache Hive was to get the insights from data. Analyzing the data and use it to take informed business decisions. …

Excellent bigdata warehouse solution

9 out of 10

April 07, 2022

Incentivized

Apache Hive is an open-source data warehouse solution built on top of Hadoop that helps to analyze a very large amount of data.
Our use …

very useful for OLTP

10 out of 10

April 06, 2022

Incentivized

We use Apache to process large data and get the output with less process time. The framework is very much useful for data processing and …

Apache Hive

9 out of 10

November 24, 2021

Incentivized

1. Used Apache Hive to create external and internal tables in Hadoop / BigData projects on Cloudera and Azure platforms. 2. Apache Hive …

Walk into the World of Big Data with Apache Hive

9 out of 10

June 02, 2021

Incentivized

We are using Apache Hive over an on-premise big data setup built on top of Cloud ERA Servers. Use case behind using Apache Hive [it] is …

Reliable and Cheaper one stop Data warehouse solution

9 out of 10

December 28, 2020

Incentivized

I have used Apache Hive in [the] last 3 companies and it's being used by the multiple departments spread across data analytics, …

Big Data the SQL way

8 out of 10

September 23, 2020

Incentivized

I am working as a Research Assistant where I have to process tons of data to produce appropriate findings. Our NLP lab used it for all its …

Apache Hive: Big data querying tool w/SQL interface, but slower, more costly computation

7 out of 10

September 21, 2020

Incentivized

We use Apache Hive to make data-driven decisions. It is used from finance to engineering to sales. It helps aggregate our massive data …

Read all reviews

Awards

Products that are considered exceptional by their customers based on a variety of criteria win TrustRadius awards. Learn more about the types of TrustRadius awards to make the best purchase decision. More about TrustRadius Awards

Return to navigation

Pricing

View all pricing

Apache Hive

N/A

Unavailable

What is Apache Hive?

Entry-level set up fee?

No setup fee

Offerings

Free Trial
Free/Freemium Version
Premium Consulting/Integration Services

Would you like us to let the vendor know that you want pricing?

24 people also want pricing

Alternatives Pricing

ClicData

$79

per month

What is ClicData?

ClicData is a 100% cloud-based business intelligence platform that allows users to connect, process, blend, visualize and share data from a single place. As an automated platform, users are able to rely on the latest version of company data, to ensure users make the right decisions. Hundreds of…

retailMetrix

$399

per month per installation

What is retailMetrix?

RetailMetrix is a data analytics platform for retailers with the mission of enabling retailers to get value from their data. RetailMatrix processes and stores sales, labor and customer data using data warehouse technologies. Its dashboards and reports allows team to find the data that matters to…

Return to navigation

Product Demos

Apache Hive Hadoop Ecosystem - Big Data Analytics Tutorial by Mahesh Huddar

YouTube

Connecting Microsoft Power BI to Apache Hive using Simba Hive ODBC driver

YouTube

Discover HDP 2.1: Interactive SQL Query in Hadoop with Apache Hive

YouTube

Return to navigation

Product Details

About
Tech Details
FAQs

What is Apache Hive?

Apache Hive Technical Details

Operating Systems	Unspecified
Mobile Application	No

Frequently Asked Questions

Reviewers rate Usability highest, with a score of 8.5.

The most common users of Apache Hive are from Mid-sized Companies (51-1,000 employees).

Return to navigation

Comparisons

View all alternatives

Compare with

Reviews and Ratings

(97)

December 15th 2023

Community Insights

TrustRadius Insights are summaries of user sentiment data from TrustRadius reviews and, when necessary, 3rd-party data sources. Have feedback on this content? Let us know!

Business Problems Solved

Apache Hive is a versatile software that has been widely used across various departments and organizations for different use cases. It has proven to be particularly helpful in handling large datasets, migrating data between different operating systems, synchronizing programs, and fetching and generating product metrics. Users have found value in using Hive for data analytics, engineering, data science, product management, and IT-related tasks such as improving analysis of big datasets stored in Hadoop HDFS.

Furthermore, Apache Hive has simplified the process of filtering and cleaning data using SQL, reducing the learning curve for handling big data. It allows users to run SQL queries against data in Hadoop, enabling efficient analysis of large datasets without the need to learn a new language. Additionally, Hive has been utilized for building reports, analyzing data stored in the Hadoop file system, processing events gathered in HDFS, and converting them into parquet files for fast querying.

Overall, users have praised Apache Hive for its scalability, accessibility, and cost-effectiveness in storing and retrieving analytics data. It has provided an intuitive solution for storing large datasets, querying big sets of data using SQL, aggregating massive datasets into distilled information for data-driven decision making, and creating external and internal tables in Hadoop/BigData projects. With its ability to process both unstructured and structured data efficiently, Hive has become an essential tool for data analysts, engineers, and business analysts across organizations.

Attribute Ratings

Reviews

(1-8 of 8)

Sort By *

Companies can't remove reviews or game the system. Here's why

April 12, 2022

Help your dev team !

Verified User

C-Level Executive in Information Technology

Computer Software Company, 11-50 employees

Score 8 out of 10

Vetted Review

Verified User

Incentivized

Use Cases and Deployment Scope

We build our data lake and perform queries on large amounts of data. We group data from multiple sources into a common structure, making it easy for our developers to perform complex queries without leaving the simple framework provided by SQL. Although the deployment is not easy, once we have the infrastructure, the work is greatly simplified.

Pros and Cons

Simplify query to devs
Organize data
Batch process

Deploy
Maintenance
Support

Likelihood to Recommend

It is great for laboratory environments and to start working with unstructured data about which we are not very clear about how we want to treat it. It also allows queries to be improved very quickly by allowing developers to work with SQL instead of map-reduce. As an improvement, in productive environments, troubleshooting is complicated and requires expert personnel.

April 08, 2022

It is an advance to the ease of the processes

Pablo Gonzalez

Internet Marketing Manager

MKTi México (Marketing & Advertising, 51-200 employees)

Score 8 out of 10

Vetted Review

Verified User

Incentivized

Use Cases and Deployment Scope

The software is intuitive from the first steps, one of the first features we take into account for the software does not allow duplicate files to be stored. It is advanced software that through data the system constantly learns and develops. The first phase is very effective, the analysis and checking of the information are verified in detail.

Pros and Cons

The unification of the data will help to establish the commercial criteria.
We are sure that the data is protected

If you try to extract an excessive amount of data, the system will become slow
You may have the danger that the system collapses due to the amount of data

Likelihood to Recommend

In addition to the fact that the information is quickly accessible through the established security protocols, it has not helped us as users to maintain a fairly comfortable data processing flow, it is more profitable to process the data in batches, we have been able to unify data from different sources

April 07, 2022

Excellent bigdata warehouse solution

Verified User

Program Manager in Information Technology

Information Technology & Services Company, 201-500 employees

Score 9 out of 10

Vetted Review

Verified User

Incentivized

Use Cases and Deployment Scope

Apache Hive is an open-source data warehouse solution built on top of Hadoop that helps to analyze a very large amount of data.
Our use case/scope is to work on a large data analytics project where the data frequency and velocity are very high. Apache Hive is very useful in processing both the unstructured and structured data in a seamless way. It help us in reducing to write complex queries as it is targeted to the SQL queries, we have a engineer team who are very proficient in writing SQL queries with the help of Apache Hive to process the big data.
We have identified no business issues using the solution.

Pros and Cons

Apache Hive supports external data tables.
Supports data partitioning to improve overall performance.
Apache hive is reliable and scalable solution.
Apache Hive supports writing ad-hoc queries as well.

Apache hive is not best suited for OLTP based jobs.
Sometimes we observed high latency rate while querying data.
Limitations on providing row-level data update.
Training materials needs improvements.

Likelihood to Recommend

Apache Hive is a data warehouse/ ETL solution that is being used for processing big data for analytics and visualizations. Apache Hive has great architecture that makes it very well suited for organizations.
The Metastore, is used for storing metadata for each table and its schema. The Driver operates as a controller for executions of the statements. Like other components such as Optimizer and CLI, Thrift Server are some components that enable the processing of big data transformation.

September 21, 2020

Apache Hive: Big data querying tool w/SQL interface, but slower, more costly computation

Kristjan Gannon

Senior Software Engineer

Sovrn Holdings, Inc. (Internet, 51-200 employees)

Score 7 out of 10

Vetted Review

Verified User

Incentivized

Use Cases and Deployment Scope

We use Apache Hive to make data-driven decisions. It is used from finance to engineering to sales. It helps aggregate our massive data sets into distilled information.

Pros and Cons

Flexibility through schema on read
Familiar SQL like query language
Functions for complex queries and analysis

Slower processing than other tools on the market

Likelihood to Recommend

Apache Hive is useful for regularly reporting and analyzing data. In terms of ad-hoc analysis and debugging, the cycles can be quite long for querying, feedback, debugging queries, etc.

September 19, 2020

Manage data for your warehouse as strong as a beehive using Apache HIve!

Ananth Gouri

Assistant Professor

The National Institute of Engineering, Mysuru (Education Management, 501-1000 employees)

Score 9 out of 10

Vetted Review

Verified User

Incentivized

Use Cases and Deployment Scope

As we all know that, Apache Hive sits on the top of Apache Hadoop and is basically used for data-related tasks - majorly at the higher abstraction level. I work as an Assitant Professor at NIE, Mysuru and I am a user of Apache Hive since the first time I taught Big Data Analytics as a PG Course to my students.
It was one of those technical sessions and I was supposed to demonstrate a word count program of a novel downloaded from the Project Gutenberg. I was successfully able to download the novel, load it into the Hadoop platform and execute a HiveQL (a SQL similar syntax used by Apache Hive) query to demonstrate for few unique words, their count, and related examples.

Pros and Cons

The capability to handle large amounts of data and its querying process.
A syntax similar to SQL is an added advantage.
An active developer support and community always ready to help.
Ease of usage.

Resource consuming sometimes. May be that I was using a larger object file.
Needs to add an update or a modify functionality. This has to be the minimilastic CRUD requirement.

Likelihood to Recommend

I would definitely recommend Apache Hive if sought by a colleague. Especially for people who are working at academic institutions, they can demonstrate programs like word count, tab count, space count, new lines count, and other related programs - with a basic setup of a HiveQL.

The only underlying problem could be that the Apache Hive is designed to run on the Apache Hadoop ecosystem. People who are not comfortable using a Linux tree structure based File System or even people who are not likely to use a Linux OS might not like to use Hive.

September 18, 2020

Apache Hive: SQL, open-source querying tool

Verified User

Analyst in Professional Services

Legal Services Company, 201-500 employees

Score 7 out of 10

Vetted Review

Verified User

Incentivized

Use Cases and Deployment Scope

Our company primarily uses Apache Hive to manage our data warehouse by being able to query multiple databases. We partition our tables as well as monitor query performance on very custom data queries by using this hive. Hive is only used by our data analysts and an overseas data warehouse team with only a few shared licenses existing on our virtual machines.

Pros and Cons

Monitor query performance
Manage tables in the data warehouse
Uses standard SQL

UI is quite dated and not intuitive
Open-source, so does not have consistent updates or support
Not the most optimal for ETL processes

Likelihood to Recommend

Apache Hive is well suited for organizations looking for an initial tool to begin their process of managing their data warehouse as it is open-source and relatively easy to set up. This works well with some legacy systems and many consoles support this. While Hive used to be quite revolutionary, it has fallen behind many other tools that are more performant or specialized for managing DBs, writing queries, and partitioning tables.

September 13, 2017

Apache Hive Review

Sameer Gupta

Senior Data Analyst

SurveyMonkey (Internet, 501-1000 employees)

Score 8 out of 10

Vetted Review

Verified User

Incentivized

Use Cases and Deployment Scope

Hive is currently being used across the entire analytics organization at SurveyMonkey. The business problem that we solve through it is, accessing/storing large data sets(typically logs), in a scalable and accessible place.

Pros and Cons

SQL like query engine, allows easy ramp up from a standard RDBMS
Scalability is great
If properly configured the data retreival is fantastic

The way we currently have it implemented is quite slow, but I believe that's more of our implementation
Joins tend to be slow

Likelihood to Recommend

I think Apache hive is great for a company just stepping into the big data realm. I think the fact that it's open source allows for a variety of tools to be integrated. The fact that it has HiveQL makes for a great transition from a standard RDMS to a big data tool. This can be very nice in terms of cost savings as the ramp up time for an analyst will be quite low.

December 22, 2014

Hive, a very powerful open source data warehouse solution.

Yinghua Hu

Senior Data Scientist

CARD.com (Financial Services, 11-50 employees)

Score 10 out of 10

Vetted Review

Verified User

Use Cases and Deployment Scope

Hive is used by data team to store the largest datasets of the company. Data is partitioned in Hive and can be queried by Impala.

Pros and Cons

Partition to increase query efficiency.
Serde to support different data storage format.
Integrate well with Impala and data can be queried by Impala.
Support of parquet compression format

Speed is slower compared to Impala since it uses map reduce

Likelihood to Recommend

Hive is a data warehouse and it does not allow for updates and deletions. If data needs to be updated frequently, it might not be the best storage solution for that purpose.

Return to navigation

Apache Hive

ClicData

retailMetrix

Apache Hive Hadoop Ecosystem - Big Data Analytics Tutorial by Mahesh Huddar

Connecting Microsoft Power BI to Apache Hive using Simba Hive ODBC driver

Discover HDP 2.1: Interactive SQL Query in Hadoop with Apache Hive

Presto

PostgreSQL

ClickHouse

Google BigQuery

MongoDB

Oracle Autonomous Data Warehouse

Oracle Exadata

SAP BW

SAP BW/4HANA

Cloudera Enterprise Data Hub

IBM Netezza Performance Server

OpenText Vertica

Cloudera Data Platform

1010data

Community Insights